HTIMIT and LLHDB: speech corpora for the study of handset transducer effects

نویسنده

  • Douglas A. Reynolds
چکیده

This paper describes two corpora collected at Lincoln Laboratory for the study of handset transducer e ects on the speech signal: the handset TIMIT (HTIMIT) corpus and the Lincoln Laboratory Handset Database (LLHDB). The goal of these corpora are to minimize all confounding factors and produce speech predominately di ering only in handset transducer e ects. The speech is recorded directly from a telephone unit in a sound-booth using prompted text and extemporaneous photograph descriptions. The two corpora allow comparison of speech collected from a person speaking into a handset (LLHDB) versus speech played through a loudspeaker into a handset (HTIMIT). A comparison of analysis and results between the two corpora will address the realism of arti cially creating handset degraded speech by playing recorded speech through handsets. The corpora are designed primarily for speaker recognition experimentation (in terms of amount of speech and level of transcription), but since both speaker and speech recognition systems operate on the same acoustic features a ected by the handset, knowledge gleaned is directly transferable to speech recognizers. Initial speaker identi cation performance on these corpora are presented. In addition, the application of HTIMIT in developing a handset detector that was successfully used on a Switchboard speaker veri cation task is described.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Divergence-based out-of-class rejection for telephone handset identification

Research has shown that handset selectors can be used to assist telephone-based speech/speaker recognition. Most handset selectors, however, simply select the most likely handset from a set of known handsets even for speech coming from an ‘unseen’ handset. This paper proposes a divergence-based handset selector with out-of-handset (OOH) rejection capability to identify the ‘unseen’ handsets. Th...

متن کامل

Divergence-based Out-of-class Reject Identificatio

Research has shown that handset selectors can be used to assist telephone-based speech/speaker recognition. Most handset selectors, however, simply select the most likely handset from a set of known handsets even for speech coming from an ‘unseen’ handset. This paper proposes a divergence-based handset selector with out-of-handset (OOH) rejection capability to identify the ‘unseen’ handsets. Th...

متن کامل

Environment adaptation for robust speaker verification

In speaker verification over public telephone networks, utterances can be obtained from different types of handsets. Different handsets may introduce different degrees of distortion to the speech signals. This paper attempts to combine a handset selector with (1) handset-specific transformations and (2) handset-dependent speaker models to reduce the effect caused by the acoustic distortion. Spe...

متن کامل

Unseen handset mismatch compensation based on a priori knowledge interpolation for robust speaker recognition

Unseen handset mismatch is the major source of performance degradation for speaker recognition in telecommunication environment since handset distortions are tightly coupled with speaker characteristics. In this paper, a soft-decision unseen handset characteristics estimation method based on a priori knowledge interpolation is proposed to decouple the characteristics of the unseen handset and s...

متن کامل

Soft-decision a Priori Knowledge Interpolation for Robust Telephone Speaker Identification

Handsets which are not seen in the training phase (a.k.a unseen handsets) are main sources of performance degradation for speaker identification (SID) applications in telecommunication environments. To alleviate the problem, a soft-decision a priori knowledge interpolation (SD-AKI) method of handset characteristic estimation for handset mismatch-compensated SID is proposed in this paper. The id...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997